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1.  Introduction 

This  ■lavandKiii  preecoba  aoma  d^anieaLE  ajratama  trhoaa  ataat^^atAta 

.'V  *  ■  , 

aplutiona  yield  aolutiona  to  a  diacreta,  ^aaa  matrix.  First  ^  consider  a  fasr  - 
aystema  of  differential  aquations  for  tb:.  ease  of  a:syimbrlc  game,  than  sdra'e  aora. 
or  less  related  eyateas  Of  difference  eqtiatidoa  for- distal  computation.  Finally^ 
we  ^ve  for  the  sake  of  the  record  a  representation  of  the  linear  profrioldnE 
problem  as  a'aymetric  game.  This  representation  serves  also  to  repi^saot  > 
asynmetric  games  in  symmetric  fom. 

2.  Differential  Bouationa  for  Brnsnetric  Qamas 

The  value  of  a  aymaftric  game  with  matrix  i>  zero,  and  l^e 

vector  l!,.ia  a  solution  if  and  only  if  0  and  1. 

Consider  the  differential  system. 


(1) 


i  l  - 


1,2, . • • •  ,m; 


x^  i  0,  some  x^  •>  0. 

Since  the  derivatives  are  all  non-negative  it  followe  from  the  initial  conditions  , 
that  Xj^  !i.O  and  ^Xj^  >  0,  for  all  t,  so  that  the  normalized  vector  ^  •  ViT*  • 
can  always  be  obtained,  fhis  system  provides  asymptotically  a  aolution  of  A  in  the 
senas  that  lim  sup  ^  0  for  all  i.  The  proof  which  follows  will  contain  a 

fairly  preciee  description  of  the  convergence* 


Setting  Uj  “  dxj^/dt,  it  followe  that  if  Uj  >  0  then  dDj/dt  ■  T*  \ , 

so  that  Uj^  dUj^/dt  -  O^^La^jUj,  in  any  case.  Suatnlng  on  i  »re  got  d/dt  Uj 

O  _  «  _  «  Li. 


•0, 


since  Aj^j  *  -  Aj^.  In  other  words  ^0^  -  k^.  But  51^^,  hence 

d/dt  ■  5!.Ui  ^  k.  Therefore  Exjb  Xxf  kt.  Setting  dj  ■  max 

after  normalizing  x  to  get  we  have 


/ 


Ij£.  ^ 


If  k  ■  0  we  have  a  eblution  to  the  gaae  based  on  xj,  and  if  k  /  0  we  can  write 

vhere  i  ■  (Zrdi)^  at  t  -  0. 

. 

Thus*  if  oonvergence  to  a  solution  be  measured  by'(Zldi)^  the  minimum  speed  of 

convergence  is  independent  of  the  order  of  A.  Since  mZ.U^  it  follows 

\ 

that  the  maxioram  speed  of  convergence  is  still  of  the  same  order.  If  convergence 
is  aeasured  by  root  mean  square  values  then  t  is  replaced  t  in 

the  corresponding  fonmila  bounding  the  error. 

For  comparison  consider  the  much  simpler  system  of  equations 
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otherwise 


■  0|  ^  0,  some  >  0. 


Where  system  (1)  says  to  increase  the  play  of  the  "profitable"  strategies,  at  a 
rate  proportional  to  their  profits,  system  (II)  says  to  increase,  at  a  constant 
rate,  the  play  of  the  moat  profitable  strategy  (against  the  current  mixed  strategy). 
In  this  case  it  can  be  shown  that  max  (]EIa<  jX,  )  is  constant.  Crudely, 

i  J 

the  argument  is  as  follows:  if  Z.A^jXj  is  a  strict  maximum,  increasing  Xj  alone 
does  not  alter  XAjjXj,  but  alters  some  or  all  of  the  remaining  ^TAjjXj  until  some 
Za^  jXj  »  jXj.  At  this  point  increases,  which  does  not  alter 

but  drives  Za^  jXj  downward,  since  Aj  ^  <  0  by  virtue  of  the  fact  that 
Aj^  and  the  fact  that  Increasing  Increased  ZAj  jXj.  But 

Zxj^i-  ZxJ  +  t,  therefore,  -when  the  x’ s  are  normal  1  zed,  max  (  ZAj^j  ^j)  » 

where  max  (Za^jX^)  -  k.  If  ZxJ  »  1,  then  max  ~  1'+  t  '  should  be 

pointed  out  that  both  equations  (I)  and  (II)  can  easily  be  converted  to  systems  of 
equations  in  ^  (which  will  now  not  even  appear  linear  in  ) .  It  is  of  some 
interest  that  the  order  of  the  convergence  is  the  same  for  (11)  as  for  (I). 


->•  • 
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A  possible  variant  of  (l)  Involves  decreasing  the  poor  strategies  as  well 
as  increasing  the  good  ones^  imposing  only  the  conditions  ^  0.  This  leads  to 
the  systen 


subject  to  iuO; 


(111) 


xj  2  0;  some  3^  ?  0. 

Without  the  restrictions  Xj^  ^  0  this  is  an  undan?>ed  oscillatory  system. 

Even  with  the  restrictions  ^3^  is  constant.  Since  the  ■  dx^^/dt  satisfy  the  same 
system  of  equations  in  an  interval  during  idilch  no  positive  x’ s  become  zero  and  no 
vanishing  x's  become  positivej  it  is  seen  that  is  also  constant  within  such 
intervals.  What  might  permit  the  aystem  to  run  down  is  that  when  some  Xj|^  hits  zero 
from  abovei  with  negative  velocity,  then  is  suddenly  diminished  by  the  i^quare 
of  the  "striking"  velocity.  On  the  other  hand,  since  ZAj^jXj  Is  continuous  no 
increase  in  i®  obtained  rvhen  a  coordinate  goes  positive,  the  coordinate  always 
leaving  the  barrier  with  zero  velocity.  The  convergence  properties  of  this  system 
are  not  known.  Shapley  has  pointed  out  that  if  the  game  has  a  unique  solution, 
involving  all  strategies  with  positive  probability,  then  convergence  is  impossible. 

In  any  case,  this  system  is  strikingly  different  from  the  systems  (I )  And  (11). 

A  general  point  about  the  systems  considered  above  is  that  this  discussion 
has  assumed  implicitly  the  existence  of  solutions,  in  .some  sense  or  other  well-behaved. 
No  formal  analysis  has  been  made  of  the  existence  of  such  solutions. 

3.  Some  Discrete  Processes 

The  work  of  the  previous  section  was  actually  motivated  by  the  consideration 
of  the  ‘'ollowing  iterative  process  for  the  general  discrete  game  Aj^j,  not  necessarily 
symmetric.  Start  with  Xj(0)  0,  some  x^(C)  >0,  yj(0)  -^0,  some  >  0.  Sot 

x^(n+l)  -  xj(n)  +  1  if  ZAijyj(n)  ^  ZAj^jyj(n)  for  all  k,  otherwise  sot 
x^(m-l)  *  x^(n),  and  set  y^(n)  -  y^(n-l)  +  1  if  Zxj(n)A^j6  Xxj(n)A^p 


for  all  p. 


>4- 
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othendse  set  y^in)  •  y^Cn-l).  This  procedure  Corresponds  to  having  each  player 

choose^  in  turn,  a  strategy  which  is  optig^ It  against  the  opponent's  mixture, 

1 

cuoulated  to  date.  Various  conventions  can  be  adopted  for  cases  when  the  optimum 
is  taken  on  simultaneously  at  toore  than  one  pure  strategy.  A  minor  variant  is 
obtained  by  having  the  pairs  of  strategies  for  each  side  chosen  simultaneously,  rather 
than  in  turn,  i.e.  by  choosing  y^(n+l)  »  y^(n)  ♦  1  if  Zx^(n)Aj^j^r  Hxj(n)Ajp  for 
all  p,  instead  of  y^(n)  ■  y^(n-l)  ♦  1,  as  above. 

Actual  computation  by  this  scheme  is  very  easy,  in  that  only  addition  and 
choice  of  maximum  are  involved.  At  each  stage  a  row  (or  a  coltunn)  of  A  is  selected, 
and  cuswlated  into  the  previous  vector  sum,  the  choice  of  row  or  coliium  being 
recorded.  Of  course  the  x's  and  y's  can  be  normalized  at  any  stage,  with  corresponding 
minima  and  suxima  being  less  than  and  greater  than  the  value  of  the  game,  respectively. 
Actually  convergence  to  the  value  of  the  game  is  not  strictly  needed,  since  one 
only  requires  that  the  limita,  superior  and  inferior,  respectively,  equal  the  value 
of  the  game. 

No  non-trivial  properties  of  this  iteration  have  been  established.  It 
le  obvious  that  if  the  mixtures  converge  they  converge  to  eolutions  of  the  game.  For 
the  2x2  game  it  can  be  shown  easily  that  the  error  is  less  than  c/n  at  an  infinite 
number  of  values  of  n. 


Consider  now  the  special  case  of  this  method  obtained  by  taking 
Ajj  ■  -  Ajj^,  with  initial  values  Xj^(O)  »  yj(0),  taking  also  the  variant  in  which 
successive  pairs  of  strategies  are  chosen  simultaneously.  Then  x^(n)  «  yj(n),  so 
that  only  one  side  need  be  worked.  This  process  is  equivalent  to  forming ZA^jXj(n) 
by  successive  addition,  adding  in  one  unit  of  the  strategy  Xj^  if  Z^A^jXj  is  maximal. 
This  is  the  discrete  equivalent  of  system  (11)  discussed  above.  The  only  difference 


is  that  me 


[rAijXj(n)] 


does  not  now  remain  constant,  except  while  the  maximal 


strategy  remains  maximal.  When  a  new  strategy  becomes  maximal  the  maximum  increases. 


-5- 

P-78 


The  Juoqp  is  due  to  the  dlacretenese,  and  becomes  relatively  less  important  further 
out  in  the  computation^  so  that  one  would  not  expect  this  system  to  look  too  saich 

different  from  the  continuous  system  (11) »  at  least  asymptotically.  This  raises 

/ 

the  question;  how  much  weight  should  be  attached  to  a  given  initial  approxlination> 
i.e.,  how  large  should  2Ixj^(0)  be  taken?  If  Zx^CO)  is  taken  largS)  then  more 
steps  are  required  to  accomplish  changes,  but  the  relative  effect  of  the  occasional 


inct^ases  in  max 


is  less* 

It  seems  reasonable  to  hope,  on  the  basis  of  the  behavior  of  system  (II) » 
that  the  discrete  methods  presented  here  will  have  essentially  the  same  order  of 
cj)nvergencsi  with  no  ssssntial  dependence  on  the  order  of  the  game  matrix.  If  this 
is  actually  the  oasei  then  the  amount  of  labor  involved  in  computinf  will  vaiy 
essentially  linearly  with  the  order  of  the  matrix,  ft^plrical  experience  with  the 
method  seems  consistent  Kith  this  hope. 


It  is  of  some  interest  to  observe  that  the  linear  system  Ax  »  y  can  be 
represented  directly  by  a  game,  provided  A  ie  non-eingular  and  x  >  0,  whose  eolutione 
for  one  of  the  players  all  yield  solutions  to  the  given  system  of  equations.  Applying 
the  iteration  above  corresponds  to  adding  columns  and  adding  or  subtracting  rows  of 
the  augmented  matrix  -  yj  ,  in  tpm,  according  to  the  rule;  add  the  column  . 
corresponding  to  the  smallest  conq>on«it  of  the  row  sum,  and  subtract  or  add  the  row 
correeponding  to  the  maximum  component,  in  absolute  value,  of  the  column  sum,  according 
to  whether  that  component  is  positive  or  negative.  Relative  frequencies  of  occurrence 
of  the  columns,  relative  to  the  frequency  of  occurrence  of  the  last  column  approximate 
to  the  solution  of  Ax  *  y. 

4.  Reduction  to  Syianetric  Games 

The  results  of  this  section  are  based  partly  on  some  results  of  Tucker 


and  on  conversation  with  Dantzig.  Later  conversation  with  Shaploy  crystallized 
some  of  these  results,  which  by  now  have  been  obtained  in  this  form  by  Tucker.  The 
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/  purpose  of  this  section  is  to  g«t  *^sai  bn  the  'record.. at  RAlIU)* 

*  '  •  '  .  .  .  •  ■  V 

Consider  a  sysnetrlo'i;ane  constructed  as  foUowst 


B  -u 


uV  -A 


0^  ^ 


where  B  Is  ext,  u  Is  sxl,  v  is  txl,  and  primes  denote  transposition.  0^  and  '' 

0-  are  composed  of  aero a,  to  fill  out  the  matrix.  The  value  of  A  is  zero,  end  the 

/3f\ 

vector!  jt  is  a  solution  of  A  if  and  only  if  . 

Bx  >  9u  ^  0,  -B*y  +  dr  -4  0,  and  u*y  -  t'x^O, 
where  y  la  sxl,  x  la  txl,  and  0  is  a  scalar,  aith  xi  0,  y  0,  ©"^O,  and 
X^c  +  Xy  0  ■  1,  Villo's  theorem  states  that  the  maximum  of  0v*x  subject  to 
Bx  -  00  0,  X  0,  ec[ud.s  the  minimum  of  0u’y  subject  to  B*y  -  0v  0,  y  ^0. 

Hence  0T’xir  0u*y  But  u*y  i  v'x,  so  that  v*x  ■  u»y  if  0  0,  and  -g  maximizes 

v'£,  subject  to  B6,  ^  u  and  <:0,  and  ^  minimizes  u*^  subject  to  ^  v 
and  ^  ^2,  0.  In  other^words,  a  solution  of  A,  in  which  the  last  row  enters  with  positive 
probability,  provides  eiaiultaneous  solutions  to  the  linear  prograianlni;~problea  and 
its  dual.  Conversely,  it  is  easy  to  see  that  solutions  to  the  two  dual  programming 
problems  provide  a  solution  of  A  in  which  the  laet  ^w'  appears  with  positive 
probability.  /lA  /I) 

Lettins  u  ■  ‘  and  v  *  i  ,  It  appears  immediately  that  the  linear 

vJ  li/ 

programing  problems  have  become  the  game  problem,  so  that  solutions  to  the 
symmetric  game  A  provide  eolations  to  the  game  B  provided  the  value  of  B  ia  positive. 

0  becomes  V(B)/[V(B)  +  ^ 


/ 


